Search CORE

2 research outputs found

Parallel computation of the hyperbolic QR factorization

Author: Čaklović Gayatri
Publication venue: University of Zagreb. Faculty of Science. Department of Mathematics.
Publication date
Field of study

U ovom radu prezentirali smo kako računati hiperboličku QR

J

-faktorizaciju. Prvo je postavljena teorija koja nam daje dva načina redukcije matrice

G \in \mathbb{C}^{m \times n}, m \geq n,

na blok gornjetrokutastu formu. Jedan način je redukcija jednog stupca pomoću

J

-Householderovog reflektora. Razjašnjeni su nužni i dovoljni uvjeti postojanja takvih operatora. Drugi način je redukcija dva stupca koristeći Givensove rotacije. U tom poglavlju je obrađeno što sve zovemo pravilnom (’proper’) formom, kako svesti matrice na pravilnu formu, te kako tu pravilnu formu do kraja reducirati

J

-unitarnim matricama manjih dimenzija. Nadalje, indefinitni QR povezali smo sa još jednom faktorizacijom, hermitskom indefinitnom faktorizacijom. Pokazali smo kako su te dvije faktorizacije povezane i koja je optimalna strategija pivotiranja u hermitskoj indefinitnoj faktorizaciji (ona koja ima najmanji pivotni rast u svakom slučaju izbora strategije, nebitno biraju li se dva ili jedan stupac za redukciju). Ista pivotna strategija primijenjena je i na QR faktorizaciju. Naposljetku, prezentiran je sekvencijalni algoritam redukcije matrice

G

na gorenje blok trokutastu formu, kao i njegovi dijelovi koji su paralelizirani. Pri optimizaciji koda, u obzir je uzeta i arhitektura memorije računala, te način funkcioniranja biblioteka OpenMP i MKL koje smo koristili za paralelizaciju. Testiranja na umjetno generiranim matricama izvedena su na Intelovom Xeon Phi 7210 računalu, gdje je također u obzir uzeta posebna memorijska arhitektura računala.In this thesis, a way of computing a

J

-unitary QR factorization was presented. The theoretical part was set first, in order to explain two possible ways of transforming a matrix

G \in \mathbb{C}^{m \times n}, m \geq n,

into a block upper triangular matrix. One way to do this is with

J

-unitary Householder like reflectors. The necessary and sufficient conditions for their existence were formed. The other way to do this is using Givens rotations. In that chapter, a term proper form was defined, how to transform matrices into proper forms and, in the end, how are those proper forms fully reduced with

J

-unitary matrices of smaller dimensions. Furthermore, we showed how indefinite QR is connected to the Hermitian indefinite factorization. An optimal pivoting strategy for the Hermitian indefinite factorization was presented, based on minimizing the pivot growth (regardless of the fact that one or two columns were chosen as pivotal). The same strategy was then used in the QR factorization. At last, a sequential version of the algorithm for reducing the matrix

G

to a block upper triangular form was presented, as well as with the parallelised segments of it. The memory architecture was took into account while optimizing the code as well as the optimal usage of OpenMP and MKL libraries. The tests on randomly generated matrices were performed on Intel’s Xeon Phi 7210. The special architecture of Xeon Phi was also taken into account

Repository of Faculty of Science, University of Zagreb

University of Zagreb Repository

Croatian Digital Thesis Repository

The LAPW method with eigendecomposition based on the Hari--Zimmermann generalized hyperbolic SVD

Author: Di Napoli Edoardo
Novaković Vedran
Singer Sanja
Čaklović Gayatri
Publication venue: 'Society for Industrial & Applied Mathematics (SIAM)'
Publication date: 01/01/2020
Field of study

In this paper we propose an accurate, highly parallel algorithm for the generalized eigendecomposition of a matrix pair

(H, S)

, given in a factored form

(F^{\ast} J F, G^{\ast} G)

. Matrices

H

and

S

are generally complex and Hermitian, and

S

is positive definite. This type of matrices emerges from the representation of the Hamiltonian of a quantum mechanical system in terms of an overcomplete set of basis functions. This expansion is part of a class of models within the broad field of Density Functional Theory, which is considered the golden standard in condensed matter physics. The overall algorithm consists of four phases, the second and the fourth being optional, where the two last phases are computation of the generalized hyperbolic SVD of a complex matrix pair

(F,G)

, according to a given matrix

J

defining the hyperbolic scalar product. If

J = I

, then these two phases compute the GSVD in parallel very accurately and efficiently.Comment: The supplementary material is available at https://web.math.pmf.unizg.hr/mfbda/papers/sm-SISC.pdf due to its size. This revised manuscript is currently being considered for publicatio

arXiv.org e-Print Archive

Publikationsserver der RWTH Aachen University

Juelich Shared Electronic Resources